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Abstract. We provide a probabilistic approach to studying minimal surfaces 
in R 3 . After a discussion of the basic relationship between Brownian motion 
on a surface and minimality of the surface, we introduce a way of coupling 
Brownian motions on two minimal surfaces. This coupling is then used to 
study two classes of results in minimal surface theory, maximum principle- 
type results, such as weak and strong halfspace theorems and the maximum 
principle at infinity, and Liouville theorems. 



1. Introduction 

The past several years have seen sustained interest in the theory of minimal sur- 
faces in R 3 . We present an approach to studying minimal surfaces using Brownian 
motion and the methods of martingale theory. We begin with a discussion of the 
basic relationship between Brownian motion on a minimal surface and the coor- 
dinate functions and the Gauss map, particularly in the cases when the surface is 
either stochastically complete with bounded curvature or (geodesically) complete 
and properly immersed. We then introduce a way of coupling Brownian motions 
on a pair of minimal surfaces such that the particles are "encouraged" to couple in 
finite time. We apply this coupling to two classes of results for minimal surfaces, 
maximum principle-type theorems, by which we mean weak and strong halfspace 
theorems and the maximum principle at infinity, and Liouville theorems. 

In both cases, we discuss the relationship between the results we obtain from 
the coupling and the existing results and conjectures obtained by non-stochastic 
methods of geometric analysis. Further results (beyond what we have been able to 
prove here) using the coupling appear to be possible, subject to obtaining a better 
understanding of the behavior of the process when the particles are close. The 
reader who is primarily interested in the geometric consequences of the coupling is 
encouraged to proceed to Section [5j where the topics just mentioned are discussed 
in detail. 

In addition to the particular results we are able to prove, there are a few more 
general reasons for introducing this approach to minimal surfaces, aside from the 
subjective claim that probabilistic methods are intuitively appealing. First, it pro- 
vides a common framework for studying maximum principle-type theorems and 
Liouville theorems. Second, there is reason to think that such a probabilistic 
framework might make it easier to extend results from minimal surfaces to minimal 
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surfaces-with-boundary. This is because Brownian motion can simply be stopped 
at the boundary, and prior to hitting the boundary it is governed by the local ge- 
ometry just as in the boundary-less case. For one example of such an extension, 
see Theorems [81 and [TOl and their proofs. 

I would like to thank Dan Stroock for numerous helpful discussions about Brow- 
nian motion and geometry and for comments on an earlier draft of this paper. I 
would also like to thank Ioannis Karatzas for advice on solving martingale problems 
(in particular, the proof of Theorem 6.1 of [5] provided an outline for the proof of 
Theorem[7]below) and Michel Emery for comments on an earlier draft of this paper. 
Finally, I am grateful to Bob Finn for introducing me to minimal surfaces several 
years ago. 



2. Brownian motion on a minimal surface 

2.1. Basic results. We begin by considering Brownian motion on a connected 
minimal surface M. This is interesting in its own right, and it will also be important 
for understanding coupled Brownian motion later on. There are several ways to 
think of Brownian motion on a manifold. For our purposes, we will think of it as the 
solution to the martingale problem corresponding to one-half the Laplacian on M; 
we now explain what this means in more detail. In general, Brownian motion may 
only be defined until an explosion time, which we denote by e. Let (C[0, e(w)), B) 
be the space of continuous paths on M, which we allow to escape to infinity at some 
time e(w) £ (0,oo] depending on the path ui, with the Borel er-algebra; we give the 
space the topology of uniform convergence on compacts. Let {2? tAe ,i > 0} be the 
filtration generated by these continuous paths. Then Brownian motion started at 
x 6 M is a probability measure on (C[0, e(ui)),B), which we denote P x , such that 

p x Mo) = x] = i 

/ ,iAe j X 

and I h(u(t A e)) — J -Ah(uj(s)) ds; B t /\ e J is a P^-martingale, 

for any smooth, compactly supported function h on M. Throughout this paper, all 
of our martingales will be continuous, and having established this convention we 
will refer to them simply as "martingales." 

Minimal surfaces are characterized by the fact that the restrictions of the coor- 
dinate functions to the surface are harmonic. From a probabilistic point of view, 
this means that the compositions of the coordinate functions with Brownian motion 
are local martingales. More concretely, consider any Euclidean coordinate system 
(xi, X2, X3) on R 3 . The gradient of any coordinate xi (in M) is the projection of 
the gradient of a;, in R 3 onto the tangent plane of M. If we denote the unit normal 
to M by m = (mi, 1112, WI3), then it is easy to check that (Va;*, Vxj) = — minij 
where 5ij is the Kronecker delta function. Let B t be Brownian motion on M. We 
will adopt the usual convention of writing the process obtained by composing a 
function with Brownian motion as / o B t = f t , and when there is no possibility of 
confusion we will sometimes omit the subscript. We see that that the quadratic 
variations and cross variations of the coordinate processes are given by 



(1) 



d{xi,t,Xj,t) = {Sij —mirrij) dt. 
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This determines the evolution of the coordinate processes in terms of the normal 
vector. Another consequence of this equation is that the coordinate processes Xi t tAe 
are true (as opposed to local) martingales. 

Next, we wish to understand the evolution of the normal vector under Brownian 
motion on M. For any smooth surface in R 3 , we can define the Gauss map G : M — > 
§ 2 by identifying the normal vector at every point of M with the corresponding 
point in § 2 . It is well known that, if M is minimal, the Gauss map is conformal 
with conformal factor — yJ\K~\, where K is the Gauss curvature. Recall that a 
map is conformal with conformal factor c (where c can vary over the domain) if 
its differential preserves angles and scales lengths by a factor of c, where c < 
means that the map is orientation-reversing. On a minimal surface K < 0, so the 
Gauss map is orientation-reversing and distorts area by a factor of | K\ . This implies 
that Gt is time-changed Brownian motion on § 2 , with the changed time given by 
ti\K*\ds. 

The introduction of the Gauss map raises the question of orientability of our 
minimal surfaces. We do not wish to restrict ourselves to orientable surfaces. To 
accomplish this, observe that the Gauss sphere process is well-defined (given a 
choice of normal vector at the starting point) whether or not the surface is ori- 
entable, because given any path there is a unique continuous choice of normal 
along it. This allows us to state Theorem Q] below, for example, without regard to 
orientability. From the geometric point of view, this corresponds to the fact that 
Brownian motion lifts to the orientation cover of the surface. With this in mind, 
we freely ignore questions of orientability in what follows. 

In order to say more, we need to place more restrictions on our minimal surface. 
A natural assumption is that M is (geodesically) complete. An example of a theo- 
rem which can be proved in this generality is a result of Osserman that the Gauss 
map of a non-planar complete minimal surface is dense in § 2 (see Theorem 8.1 
of |llj). However, as mentioned above, we are more interested in global control of 
the immersion and on Liouville properties of the surface. If we place no restrictions 
on M beyond completeness, then not much can be said in this direction. In [10j . 
Nadirashvili showed that there exists a complete, minimal, conformal immersion of 
the disk into the unit ball. 

Another natural assumption is that of stochastic completeness. Probabilisti- 
cally, this means that Brownian motion almost surely exists for all time, that is, it 
does not explode by "going off to infinity" in finite time. Analytically, stochastic 
completeness means that the Cauchy initial value problem for one-half the Lapla- 
cian with bounded initial data has a unique bounded solution for all time. Let 
p = \/ x\ + x\ + x\. Then a straightforward computation shows (see section 5.2.2 
of [14]) that ^Ap 2 = 2. It follows from the defining property of the martingale 
problem that 

(2) E [p 2 tAe ] = pi + 2 (t A e) for alii e [0, oo). 

If M is stochastically complete (which is equivalent to e being identically infinite) , 
this equation holds with t A e replaced by t. This obviously prevents M from 
being contained in a ball. Moreover, because the quadratic variation of a single 
coordinate is no greater than t, it also implies that M cannot be contained in an 
infinite cylinder (that is, a set of the form x\ + x\ < C for some C > and some 
choice of orthonormal coordinates on R 3 ). In other words, if M is a stochastically 
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complete minimal surface, then there is at most one element of MP 2 such that the 
projection of M onto that line is not the entire line. 

If we assume only that M is stochastically complete (or even complete and 
stochastically complete), the preceding is as much as we can say in this direction. 
Jorge and Xavier [5] show how to construct complete, minimal, conformal immer- 
sions of the disk into the "slab" {(xx, X2, X3) : —1 < x\ < 1}. It is relatively easy 
to see that their construction can be performed in such a way that the resulting 
minimal surface is also stochastically complete. Thus the above result that, for a 
stochastically complete minimal surface M, there is at most one line in RP 2 such 
that the projection of M onto that line is not the entire line is sharp. 

2.2. Bounded curvature and the weak halfspace theorem. In pi)], Xavier 
proved the weak halfspace theorem for complete minimal surfaces of bounded cur- 
vature; that is, any complete minimal surface of bounded curvature which is not 
a plane is not contained in any halfspace. Our goal here is to prove a differential 
version of the weak halfspace theorem, namely, that the Gauss process accumu- 
lates infinite occupation time in every open subset of the Gauss sphere, almost 
surely. Xavier's weak halfspace theorem as just mentioned is an obvious corollary, 
given the relationship between the normal vector and the quadratic variation of the 
coordinate processes in Equation {]]). 

It is well known that any complete manifold with bounded curvature is stochas- 
tically complete, and thus the assumptions in the theorem below are weaker than 
in Xavier's weak halfspace theorem. 

Theorem 1. Let M be a minimal surface, and assume either that M is recurrent 
or that M is stochastically complete and has bounded curvature. If M is not flat 
(that is, K is not identically zero), then the corresponding Gauss sphere process 
almost surely accumulates infinite occupation time in each open subset of § 2 . 

Proof: First, we observe that recurrent minimal surfaces are easy to handle. Any 
recurrent manifold is necessarily stochastically complete. Now suppose that M is 
a recurrent, non-flat minimal surface. Because M is not flat, there are at least 
two points in § 2 in the image of the Gauss map. By recurrence, the normal vector 
visits a neighborhood of each of these points infinitely often, and thus we see that 
the Gauss sphere process has infinite quadratic variation. Since it is time-changed 
Brownian motion, the Gauss process visits every open subset of § 2 infinitely often. 
For any open subset of S 2 , we can choose some component of its pre- image in 
M. By recurrence, Brownian motion on M accumulates infinite occupation time 
in that pre-image almost surely, and we conclude that the Gauss sphere process 
accumulates infinite occupation time in every open set of S 2 almost surely. 

Now assume that M is transient, not flat, and stochastically complete with 
bounded curvature. Thus its universal cover M is conformally equivalent to the 
unit disk, by the uniformization theorem. It is obviously enough to prove the result 
for M. We will write the metric with respect to the usual Cartesian coordinates on 
the disk as XSij , where 5ij is the Kronecker delta function (so lengths are scaled by 
\/A while area is scaled by A). Here, of course, A is smooth and positive on the open 
unit disk D; also, A determines the time-change taking Euclidean Brownian motion 
on the disk to Brownian motion on M. In particular, the stochastic completeness of 
M is equivalent to the statement that the integral of A along paths of the Euclidean 
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Brownian motion (until the first hitting time of the boundary of the disk) is almost 
surely infinite. 

Let / and g be the Weierstrass data for M, as described in Lemmas 8.1 and 8.2 
of [llj . In particular, g is the stereographic projection of the Gauss map and thus 
meromorphic, and / is holomorphic. Further, we have that (see Equation (8.7) and 
Lemma 9.1 of [llj . and note that our definition of A differs from Osserman's by a 
power of two) 



A 



1/1 (i + lsl 2 ) 



and — K 



4| 5 'l 



1/1(1 + 1.9 



2^ 



We wish to show that the integral of — K along Brownian paths of M is almost 
surely infinite. Note that g t (which, we recall, is goB t where B t is Brownian motion 
on M) is a complex martingale with quadratic variation given by the integral of 
— K along B t . Thus, showing that —K has infinite integral along B t is equivalent 
to showing that g t almost surely does not converge, which in turn is the same as 
showing that g does not have non-tangential limits at the boundary of the disk, 
except possibly on a set of Lebesgue measure zero. This last equivalence is a 
consequence of Doob's form of the Fatou theorem. 

We proceed by contradiction, assuming that with probability 2p > 0, gt does con- 
verge. We may assume that our Brownian motion begins at the center of the disk, 
and thus that the hitting measure of the boundary is proportional to its Lebesgue 
measure. Possibly after a rotation (of our coordinates on R 3 ), we can assume that 
gt has limits with absolute value less than C with probability p, for some positive 
constant C. Because of the above formula for —K, bounded curvature implies that 
there is a constant C such that \g' / f \ < C whenever \g\ < 2C. Because a mero- 
morphic function on the disk has a non-tangential limit at a boundary point if and 
only if it is non-tangentially bounded at that point (again by Doob's version of the 
Fatou theorem), it follows that g'/ f has a non-tangential limit with absolute value 
less than C on a set of boundary points of Lebesgue measure 2np. The set of such 
points where the limit is zero must have Lebesgue measure zero since otherwise 
g' / f would be identically zero, which would mean that g' was identically zero, con- 
tradicting the assumption that M is not flat. So there is a set $ of boundary points 
of Lebesgue measure 2itp where g has non-tangential limits of absolute value less 
than C and g'/f has non-tangential limits with absolute value in (0, C). 

For any Euclidean Brownian path hitting the boundary at $, we see that the 
integral of A along the path being infinite almost surely implies that the integral 
of |/| 2 along the path is infinite, using the above formula for A in terms of the 
Weierstrass data. This in turn almost surely implies that the integral of \g'\ 2 along 
the path is infinite. However, the integral of \g'\ 2 is the quadratic variation of the 
g t (which, we recall, is a complex martingale). We conclude that gt almost surely 
does not converge along Brownian paths which hit the boundary at $. This is a 
contradiction, and thus we have shown that the integral of —K along Brownian 
paths of M is almost surely infinite. 

Because the Gauss sphere process is time-changed Brownian motion on the 
sphere with the time-change given by the integral of —K, this shows that the 
Gauss sphere process hits every open subset of § 2 infinitely often. Now consider B e 
an open ball in S 2 of radius e, and let -B e /2 be an open ball with the same center 
and half the radius. Because —K is bounded from above, there exists some 8 > 
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such that every time the Gauss sphere process hits B e / 2 , it spends time S in B t 
with probability at least 1/2. An easy application of the Borel-Cantelli lemma then 
shows that the process spends an infinite amount of time in B t , almost surely. As 
this argument applies to any open ball in S 2 , the theorem is proved. □ 

We note that this theorem does not require that M is complete. This also 
explains why the hypothesis of the theorem is that M is not flat, rather than that 
M is not a plane. For example, let M be the universal cover of a plane minus 
two points, with the obvious immersion. Then AI is a flat, stochastically complete 
minimal surface, but M is transient and, in particular, not a plane. 

2.3. Proper immersion and the weak halfspace theorem. A weak halfspace 
theorem can also be proved for properly immersed minimal surfaces; namely, if M 
is a complete, properly immersed minimal surface, and M is not a plane, then M 
is not contained in any halfspace. This was first done by Hoffman and Meeks [4], 
using a geometric construction comparing M to the lower half of a catenoid. As 
noted in [7] (see the discussion surrounding Theorems 1.3 and 1.4 there), it is 
also a simple consequence of Theorem 3.1 of [2J, which is proved using elementary 
harmonic function methods. 

Before continuing, we make an observation about terminology. Any properly 
immersed minimal surface is necessarily complete, and thus the phrase "complete, 
properly immersed minimal surface" is somewhat redundant. Nonetheless, we will 
employ this phrase in order to highlight the parallel between them and stochasti- 
cally complete minimal surfaces of bounded curvature, since we are viewing both 
conditions as strengthenings of the corresponding assumptions of (geodesic or sto- 
chastic) completeness. 

We claim that Equation j5J implies that any complete, properly immersed min- 
imal surface is stochastically complete. Brownian motion on a complete manifold 
explodes in finite time if and only if it exits every compact set in finite time. Proper- 
ness of the immersion means that exiting every compact set of M is the same as 
exiting every compact set of R 3 , which is the same as p\ blowing up in finite time. 
However, Equation (|2|) implies that the expectation of p 2 remains finite at all times. 
This justifies our claim. Thus the assumption that M is properly immersed (and, 
necessarily, complete) is a strengthening of the assumption that M is stochastically 
complete. 

Our probabilistic proof is similar, both in spirit and technique, to the harmonic 
function approach mentioned above. Also, we note that we are unable to give a 
stronger, "differential," version as we did for the bounded curvature case (more on 
this below). 

Theorem 2. Let M be a complete, properly immersed minimal surface. If M is 
not flat, then M is not contained in any halfspace. 

Proof: Assume that M is not flat. The case when M is recurrent is already 
covered by Theorem [T] so we assume that M is transient. 

We begin by showing that the integral of 1 — m§ along Brownian motion on 
M blows up almost surely. We will proceed by contradiction; assume that the 
integral of 1 — m\ is bounded with probability p > 0. Let r = \J x\ + x\. Then the 
semi-martingale decomposition of the r t process is 

dr t = Jl-(d r - mf dW t + — (mj + (d r ■ m) 2 ) dt + dL t 
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where Wt is some Brownian motion, and Lt is an increasing process which increases 
only when rt — 0. Also, d r is the ffi. 3 -gradicnt of r (since we can, at least locally, 
identify M with its image in R 3 , there is no problem in thinking of Brownian motion 
on M as a martingale in R 3 and writing the coefficients of the semi-martingale 
decomposition in terms of the geometry of M 3 ). This is the geometer's convention 
of identifying vector fields with first-order differential operators, and we will adopt 
this convention freely in what follows. We will assume, for simplicity, that we start 
our Brownian motion at a point with r > 0. 

We wish to compare the r t process with the 2-dimcnsional Bessel process gener- 
ated by the same Brownian motion, that is, the strong solution of 

1 



df t = dW t + — - dt with f =r > 0. 
Then the process (r — f) t has the semi- martingale decomposition 



d(r-f), = 



1 — (d r ■ m) — 1 



dB t + 



dt+ dL t . 



1 

We will also write this in integrated form as (r — f) t = M t + A t + L t . We wish to 
control the size of excursions of r t above f t . Introduce the stopping time 

ft 



a(C) = inf it > : / (1 



> C 



We will assume that C is chosen large enough so that <j(C) 
at least 3p/4. 

The quadratic variation of M satisfies 



oo with probability 



(M) 



tAcr(C) 



L 



t/\a(C) 



2 1 - J 1 - (d r ■ my - (d r ■ m) 



dr 



< 2 



L 



t/\a(C) 



1 — y 1 — (d r ■ m) I dr. 



Further, because (d r ■ m) < 1 — m\, we see that 

1 — \j\ — (d r ■ m) 2 < 1 — \mz\ < 1 — m\. 



It follows that (M) 



t/\a(C) 



< C. Thus whatever else it is, M, 



tAcr(C) 



is a continuous 



process with both sup i>0 {M tAcr ( C )} and infi> {M tAcr ( C )} almost surely finite. 
Now introduce rj(t) as the random (but not stopping) time defined by 

77(f) = sup{r < t : r T < r T + 1}; 

that is, rj(t) is the last time before t when r has been at or below r + 1. Since 
integrals of the non-martingale terms make sense with random times that aren't 
stopping times, we can write 



(r — r ) 



t/\a(C) 



< 1 + M, 



tAcr(C) 



M, 



v(tA*(C)) 



rtAa(C) r I -1 - r- 

Jr,(tAa(C)) [2r T V 3 ' J 2r T \ L 



r tA<r(C) 



dL 7 



lr,(tAa(C)) 



Note that + (d r ■ m) 2 < 1, and thus that the first integrand is always non- 
positive on the interval of integration. Next, because f t is a 2-dimensional Bessel 
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process, it is almost surely positive for all time. Thus the Lt term never increases 
when rt> ft, and thus never increases on the interval of integration. It follows that 

- f )tA<x(C) ^ 1 + M tA*(C) ~ Af, (tAa(c)) 

Then we conclude that 

sup{r tACT(c) - f tA<T (c)} < 1 + su P{ M tA<x(C)} - inf{M tAff(c) } 

t>0 t>0 *>° 

< oo 

almost surely. 

To continue, note that because 

r-t/\<j{C) 

( x 3)tA,a(c)= J o {l-ml)dT<C, 

we see that sup t > {|x3| tAo .(c)} < oo almost surely. Recall that with probability 
3p/4 > 0, we have a(C) = oo. Thus with probability 3p/4, we have that sup i>0 {r t — 
f t } < oo and sup t>0 {|a;3| t } < oo. Now r t is recurrent for the set f < 1. It follows 
that, on the set of paths with <r(C) = oo, we know that rt returns infinitely often 
to some bounded interval (depending on the particular path) . Since a subset of R 3 
with r and x 3 bounded is contained in a compact cylinder, it follows that the set 
of paths with a(C) — oo returns infinitely often to a compact cylinder (the exact 
cylinder depending on the path) . Because the immersion is proper, the intersection 
of any compact subset of R 3 with M is also compact. Thus, almost every path 
with <t(C) = oo returns infinitely often to some compact subset of M. This implies 
that, with probability 3p/4, Brownian motion on M is recurrent. This contradicts 
our assumption that M is transient, and we conclude that the integral of 1 — m\ is 
almost surely unbounded. 

This means that x^,t is a martingale with almost surely unbounded quadratic 
variation. Thus M is not contained in the halfspace x 3 > 0. Since our choice of 
Euclidean coordinates on R 3 was arbitrary it follows that M cannot be contained 
in any halfspace. □ 

Note that the computation showing that Brownian motion on M is recurrent on 
some compact subset of R 3 doesn't use the fact that the process is restricted to 
a surface. In particular, consider any R 3 -valued (continuous) martingale Y such 
that the diffusion matrix of Y at each instant is given by the diffusion matrix for 
Brownian motion on some plane (if Y is Brownian motion on M, then this plane 
is the tangent plane to M at Y). Further, suppose that one of the coordinates 
of Y has finite quadratic variation with probability p > 0. Then the proof shows 
that, up to a set of probability zero, those paths for which a coordinate of Y is 
bounded are recurrent on some compact subset of R 3 (with the particular subset 
depending on the path). That the corresponding plane- field is integrable (in the 
sense of the Frobenius theorem) and the resulting surface complete and properly 
immersed in the case of Brownian motion on a minimal surface is only used to show 
that recurrence on R 3 implies that Brownian motion on M is recurrent. 

One consequence of this is that the theorem also applies to complete, properly 
immersed branched minimal surfaces. In particular, there are at most countably 
many branch points, so Brownian motion almost surely avoids them all. We then 
see that the above proof applies in this case as well. 

As mentioned, we do not have a "differential" version of this result, as we do in 
the bounded curvature case. In particular, nothing in this theorem rules out the 



A MARTINGALE APPROACH TO MINIMAL SURFACES 



9 



Gauss process spending only finite time in any closed set in RP 2 not containing the 
North/South pole. As we will see later, this difference will mean that we can prove 
more in the bounded curvature case than in the properly immersed case. 

3. Coupled Brownian motion 

To address more sophisticated issues, we will need to do more than study a single 
Brownian motion on a minimal surface. In particular, for any two stochastically 
complete minimal surfaces M and N (where we allow the possibility that they are 
the same surface), we wish to couple Brownian motions on the surfaces such that 
the R 3 -distance between the two particles goes to zero as efficiently as possible. We 
will do so by constructing a diffusion on the product space that, pointwise, gives 
a favorable evolution for the distance between the particles, in a sense to be made 
precise below. 

3.1. Pointwise specification of an optimal coupling. We begin by determining 
what the coupling should be "instantaneously." 

Let x = xt be the position of the Brownian motion on M and y — y t be the 
position of the Brownian motion on N. Let r = r(x, y) be the R 3 -distance between 
x and y. We assume that r(x,y) > 0, since we stop the process if the particles 
successfully couple. The instantaneous evolution of r will depend only on r and 
the positions of the tangent planes T X M and T y N (and the coupling). We wish to 
understand this dependence in detail and use it to see how to choose the coupling. 
As such, we will consider x and y to be given, fixed points and construct our analysis 
around them. 

We can choose orthonormal coordinates (zi, Z2, z%) for R 3 which are well-suited 
to the current configuration (we use Zi for our coordinates here instead of Xi in order 
to avoid overburdening the notation, considering that we use x to denote a point 
in M). Let d Z3 lie in the direction of x — y, where we view y and x as elements of 
R 3 under the corresponding immersions of N and M. Let d Z2 lie in T X M . (As long 
as T X M is not perpendicular to x — y, this will determine d Z2 up to sign.) Finally, 
d Zl is chosen to complete the orthonormal frame. Such a choice of orthonormal 
frame determines orthonormal coordinates on R 3 up to translation. Since we are 
only interested in the relative positions of x and y, any choice of coordinates {z^ 
corresponding to this choice of frame {d Zi } will work. 

Given such coordinates, T X M is determined by the angle it makes with the z\z 2 - 
plane; call this angle 9. We choose an orthonormal basis {d a ,dp} for T X M such 
that, at x, 

VniZi—cosOda, VmZ2 = 9/3, and Vmz 3 = sin 9d a , 

where V MZi is the gradient of zi restricted to M. In other words, dp is in the d X2 
direction, while d a lies in the intersection of T X M and the ziZ3-planc. 

In order to specify the direction of T y N , we will need two angles. Let ip be the 
angle T y N makes with the ziZ2-plane. Let ip be the angle between the intersection 
of T y N with the ziz 2 -plane and d Z2 . Then we can choose an orthonormal basis 
{d a , db} for T y N such that, at y, 

VatZi = cos ip cos tpd a + sin tpdb, Vatz 2 = — cos ip sin tpd a + cos tpdb, 
and Vatz 3 = sin ipd a , 
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This means that db lies in the intersection of T y N and the a;iX2-planc, while d a lies 
along the projection of 8x3 onto T y N . 

Note that, after possibly reflecting some of the Zi and exchanging the roles of M 
and N, we can, and will, assume that 

0e [0,7r/2], [0,7r/2], andVe[0,7r]. 

Further, whenever 9 or p is zero, we can, and will, take ip to be zero. With these 
conventions, there is a unique choice of (9, ip, ip) for every (x, y) € (M x N)\{r = 0}. 
Since the coupling at each point will depend only on these three angles, we will refer 
to them as the configuration of the system at this instant. It is clear that the map 
into the configuration space is smooth near any point where all three angles are in 
the interior of their ranges, but in general will be discontinuous if any of them is at 
the boundary of its range. For the moment, we are concerned only with determining 
the coupling pointwise. Later, when we consider the induced martingale problem 
on M x TV \ {r = 0}, we will have to account for the behavior of the configuration 
as (x,y) = (x t ,y t ) varies. 

With the product metric on M x N, again using the fact that x and y can be 
viewed as elements of K 3 , and viewing the coordinate Zi clS cL function on M 3 that 
gives the ith coordinate, we have 

V(mxN) z i( x — y) = cos9d a — cos ip cos ipd a — sinipdb, 

(3) V (MxA r)Z 2 (a; - y) = dp + cos tp sin ipd a - cosipd b , 

and V( MxA r)Z 3 (2; - y) = sin6»<9 Q - sinpd a . 

Now we wish to see what the above computations mean for the evolution of r 
under some (to be determined) coupling of Brownian motions on M and N. Our 
orthonormal basis {d a , dp} for T X M uniquely determines normal coordinates (a, (3) 
in a neighborhood of x in M, and we similarly have normal coordinates (a, b) in a 
neighborhood of y in N. Further, (a, f3, a, b) gives product normal coordinates in 
a neighborhood of (x,y) in M x N. Since the Zi t t — Zi(x t — yt) are martingales, 
their instantaneous evolution is determined by the rate of change of their quadratic 
variations. The above expressions for the gradients mean that, at this instant with 
{xt,yt) = (x,y), we have 

d (zi.t) = (cos 2 9 + cos 2 ip cos 2 ip + sin 2 ip) dt — 2 cosQ [cos <p cos ip d (at, at) 
+ smipd(a t ,bt}] , 

d (z2,t) = (l + cos2 psin 2 ip + cos 2 ip) dt — 2 [cos ipd (j3 t , b t ) — cos ip sin ip d (/3 t , a t }] , 

and d (23.4) = (sin 2 9 + sin 2 p) dt — 2 sin sin d (at, at) ■ 

Here we have used that the marginals are Brownian motions, so that the only 
unknown quantities are the four cross-variations above, which relate the Brown- 
ian motions on M and N. It is these four cross-variations which we think of as 
determining the coupling at (x,y). 

The Zi were chosen so that z\(x — y) = z 2 (x — y) = and z 3 (x — y) = r. 
Thus, the semi-martingale decomposition of r = \J zf + z\ + z|, at this instant 
with (xt, yt) = (x, y), is easy to compute using the above equations for the quadratic 
variations of the z$. In particular, Ito's formula implies that, at this instant with 
(xt, yt) = [x, y), the bounded variation part is growing at rate (d (z\ t t) + d (z2,t)) /2r 
while the quadratic variation of the martingale part is growing at rate d (23, t). We 
introduce one more bit of notation. Let / and g be such that the martingale part 
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has quadratic variation given by the integral of / along paths and the bounded 
variation part is given by the integral of g/2rt along paths (plus ro). Then / and g 
are non-negative functions of the configuration (8, tp, tp) and the coupling, and they 
determine the evolution of r t . 

We return to discussing how the Brownian motions should be coupled at (x, y). 
We wish to consider couplings which make the cross- variations as large (in absolute 
value) as possible, since it is intuitively clear that such couplings give the most 
control over / and g. We are working within the framework of the martingale 
problem, but the type of couplings we are considering can perhaps be explained best 
from the point of view of stochastic differential equations. From this perspective, 
we are saying that the Brownian motion on N should be driven by the Brownian 
motion on M (at least at the instant under consideration), or vice versa, since the 
roles of the two surfaces are symmetric. At any rate, such couplings are, at a point, 
parametrized by isometries between T X M and T y N. In terms of our orthonormal 
bases, all such maps can be written in the form 

d a = A cos <rd a + A sin adp , 

db = — sinad a + cos cr<9/3, 

where a € [0, 2tt) and A is ±1. Note that we're simply relating the tangent planes 
by an element of 0(2). Our choice of orthonormal bases for the tangent planes 
determines coordinates for 0(2) as above, where A determines whether we're in the 
orientation-preserving or orientation-reversing component and a then parametrizes 
the relevant component, which is diffeomorphic to S 1 (it is a slight extension of the 
term "coordinates" to call a and A coordinates, but the meaning is clear and no 
harm is done). 

In terms of a and A, the four cross- variation terms relating the Brownian motions 
on M and N , at the instant when (xt, yt) = (x, y), are given by 

d (a t , a t ) = A cos a dt, d {fit, at) = A sin a dt, 

(4) 

d (at, bt) = — sine dt, and d (fit, &t) — cos a dt. 

Then, at the instant when (xt,yt) = { x ,y), the instantaneous evolution of r t is 
determined by the equations 

/ = sin 2 9 + sin 2 tp — 2 A sin 9 sin tp cos a and 

g = 2 + cos 2 9 + cos 2 tp — 2 cos a [A cos 9 cos tp cos tp + cos ip] 

+ 2 sin a [A cos tp sin tp + cos 6 sin tp] . 

We are now in a position to indicate what the criterion for our coupling to be 
"optimal." We wish to consider the r t process on new a time-scale, namely the 
time-scale in which its quadratic variation grows at rate 1. In this time-scale, the 
martingale part is just a Brownian motion, and the drift coefficient is given (at the 
instant for which (x t ,yt) = {x,y)) by ^f. Thus, the time-changed process will be 
dominated by a two-dimensional Bessel process if g < f, and the domination will be 
strict whenever this inequality is strict. Note that, unlike g/f, this inequality makes 
sense even if / = 0. That the two-dimensional Bessel process is the critical case will 
be seen in the following analysis (we mention it here for future reference). That 
the two-dimensional Bessel process is critical is obviously significant in our effort 
to make r hit zero as efficiently as possible, in light of the well-known relationship 
between the dimension of a Bessel process and its long-time behavior. 
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Motivated by this, we will choose our coupling so as to maximize 

/ — g = — 2 cos 2 9 — 2 cos 2 tp + 2 cos a [A cos 9 cos tp cos tp 

+ cos tp — A sin 9 sin </?] — 2 sin a [A cos </? sin tp + cos sin tp] . 

If we assume for a moment that A is fixed, then it is clear how to choose a in 
order to maximize this expression. In particular, we can view the two bracketed 
expressions as being the two components of a vector in the plane, in which case the 
two terms depending on er become the dot product of this vector with a unit vector 
making angle a with the positive horizontal axis. Thus we should choose a so that 
these vectors are parallel. Let a + be this optimal choice when A = 1 and er_ be 
this optimal choice when A = — 1. 

Writing explicit formulas for a + and er_ isn't important (although it could eas- 
ily be done in terms of inverse tangents); instead we can write directly that the 
maximum of / — g arising from the optimal choice of er + or er_ is 

- 2 cos 2 9 - 2 cos 2 tp + 2 {cos 2 tp (l + cos 2 9 cos 2 tp) + sin 2 tp (cos 2 9 + cos 2 tp) + 

1 /2 

sin 2 9 sin 2 tp — 2 cos 9 cos tp cos tp sin 9 sin tp + 2 A (cos 9 cos tp — cos -0 sin 9 sin </?) } , 

where we've used the fact that A 2 = 1. Further, it's now clear how to choose A; 
we want A to be equal to the sign of the expression it multiplies. Doing so shows 
that the maximum of / — g, that is, the value realized by the optimal coupling, is 

- 2 cos 2 9-2 cos 2 tp + 2 {cos 2 tp (l + cos 2 cos 2 + sin 2 tp (cos 2 6> + cos 2 tp) + 

1 /2 

sin 2 9 sin 2 tp — 2 cos cos <p cos i/' sin 9 sin <y9 + 2 |cos 9 cos (/? — cos tp sin sin tp\ } 

Let E_|_ be the region (in (M x N) \ {r = 0}) where cos cos </? — cos tp sin 6> sin tp is 
non-negative, and let £_ be the region where this expression is non-positive. Then 
E + is the set where the maximum of / — g is realized by a coupling coming from 
the orientation-preserving component of 0(2), and E_ is the analogous set for the 
orientation-reversing component of 0(2). Here the notions of orientation-preserving 
and orientation-reversing are defined relative to the standard choice of orthonormal 
bases for T X M and T y N given above (which we recall are not continuous when the 
configuration is at the boundary of its range). Further, let Eo be the set where 
cos 9 cos tp — cos tp sin 9 sin tp is zero; this is the set on which there are two possible 
choices for an optimal coupling, one from each component of 0{2). We will return 
to questions of how the choice of optimal coupling varies over (M x N) \ {r = 0} 
below. In particular, we will see in Lemma[3]that So is defined invariantly, that is, 
it does not depend on the particular identification of maps from T X M to T y N with 
0(2). 

Next, we wish to show that the optimal coupling is good enough for our purposes. 
In particular, we wish to show that the above expression for the maximum of / — g 
is non- negative for any values of the parameters, which will show that the time- 
changed rt process is dominated by a two-dimensional Bessel process (assuming 
that a coupling satisfying this pointwise specification exists, a question which we 
continue to postpone) . We also wish to determine the values of the parameters for 
which the expression is zero, since those will be the configurations where the pro- 
cess looks instantaneously like a two-dimensional Bessel process, rather than being 
strictly dominated by one. Observe that the non-negativity of / — g is equivalent 
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to the inequality 

cos 2 tp (l + cos 2 cos 2 tp) + sin 2 tp (cos 2 + cos 2 p) + sin 2 9 sin 2 p 
(6) — 2 cos 9 cos </? cos ^ sin 9 sin 93 + 2 |cos 6* cos 99 — cos tp sin sin tp\ 

> (cos 2 + COS 2 if) 2 , 

and positivity is equivalent to strict inequality. 

We begin by considering the case when tp = (this occurs when x — y and the 
normal vectors to T X M and T y N are coplanar, as vectors in M 3 ). Then the above 
simplifies to 

1 + cos 2 9 cos 2 tp + sin 2 9 sin 2 p — 2 cos cos 99 sin sin tp + 2 |cos(0 + 93) | 
> (cos 2 + cos 2 <p) 2 . 

After simplifying the left-hand side and taking the square root of both sides, we 
see that this is equivalent to 

|cos(0 + ip) \ + 1 > cos 2 9 + cos 2 ip. 

One then can show that this inequality holds, and that one has equality exactly 
when 9 — ip < 7r/4 or 9 + ip = ir/2. 

Recall that whenever either 9 or tp equals zero, we can assume that tp = 0. Thus 
we now consider the case when all three angles are positive. First we assume that 
9 + tp > 7r/2. We can rewrite inequality ((6]) as 

cos 2 9 + cos 2 <p - (cos 2 9 + cos 2 pf + (cos 2 tp + l) sin 2 9 sin 2 tp 

— 2 cos tp cos 9 cos tp sin 9 sin 99 + 2 |cos 9 cos 99 — cos tp sin sin 95 1 > 0. 

Now + ip > tt/2 implies that 

sin sin 99 > cos cos 99 and cos 2 + cos 2 ip < 1 . 

It follows that the sum of the first three terms is non-negative. Along with the fact 
that 

cos 2 tp + 1 > 2 cos tp for tp 6 (0, 7r] , 

it follows that the sum of the fourth and fifth terms is strictly positive. Since the 
last term is non-negative (it is an absolute value), we conclude that the inequality 
holds strictly. 

We continue to assume that all three angles are positive, and now we consider 
the case 9 + ip < tt/2. We can rewrite inequality ((6|) as 

2 cos cos tp (1 — (cos cos tp + sin sin p cos tp)) + cos 2 sin 2 

+ cos 2 p sin 2 p + (cos 2 tp + l) sin 2 sin 2 p — 2 sin sin p cos tp > 0. 

We know that this holds for 7^ = 0. We compute the derivative of the left-hand 
side with respect to tp as 
d 

— — l.h.s. = 2 sin tp sin sin 99 (1 + cos cos 99 — cos tp sin sin tp) . 

dtp 

This is positive as soon as tp is positive, and we conclude that the desired inequality 
holds strictly when all three angles are positive. 

This shows that, under the optimal choice of A and a, f — g is always non- 
negative. Further, / — g is positive except in the following configurations. (That 
these configurations are the only ones with / — g = is something we computed 
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above. The additional details we are about to provide in these cases follow from 
elementary trigonometry and the above equations.) When "0 = and 9 — ip < 7r/4, 
we have / = g = 0. When tp — and + tp = tt/2, there are two possible 
couplings giving the maximum value of / — g, one orientation- preserving and one 
orientation-reversing. For both choices of optimal coupling, / and g are equal and 
positive (except when 6 = ip = 7f/4), although their shared value is larger under the 
orientation-reversing coupling. Finally, the case = and 9 = ip = 7r/4 lies at the 
border between the above configurations. Here, there are again two possibilities for 
the optimal coupling. The orientation-preserving choice gives / = g = 0, while the 
orientation-reversing coupling gives f = g = 2. 

Recall that our goal is to produce a coupling for which the particles are encour- 
aged to couple in finite time. Toward this end, it is not so much the specific coupling 
that matters to us as much as its qualitative features, especially the domination by 
a time-changed two-dimensional Bessel process. In particular, the distance between 
the particles, which we called r, is a semi-martingale under any coupling, and this 
semi- martingale is characterized by the corresponding versions of / and g. We call 
any coupling with the feature that its corresponding versions of / and g satisfy all 
of the properties of / and g listed in the previous paragraph (that is, the domi- 
nation by a time-changed two-dimensional Bessel process is strict expect for the 
configurations given, for which strictness fails in the ways described) an adequate 
coupling. As this terminology suggests, the coupling we've already described is the 
optimal one from our point of view, but any adequate coupling is still good enough 
for our exploration of minimal surfaces. We will see why this matters in the next 
section. 

3.2. The martingale problem. The computations in the previous section de- 
scribe what the optimal coupling should be at a single point of (x, y) € Mx N\ {r = 
0}; more specifically, they specify the desired covariance structure at each point. 
The next task is to show that this gives a global description of the coupling. (Dealing 
with existence is a separate question. Here we just want to see that this pointwisc 
choice of a and A leads to a well-defined operator such that the corresponding 
martingale problem gives the coupling we're looking for.) 

As we have already indicated in the previous section, we think of a coupled Brow- 
nian motion as a process on the product space such that the M and N marginals 
are Brownian motions. More specifically, we can formulate this in the language 
of the martingale problem, and we begin by briefly recalling what the martingale 
problem is (for a more detailed explanation, consult the standard text [15j). Let L 
be a second-order operator on M x N . For simplicity, we consider the case when 
the solution is not allowed to explode (which will be the case for coupled Brownian 
motion on stochastically complete manifolds); extending this to allow for explosion 
is easily accomplished by stopping the process at the explosion time, as we did for 
Brownian motion at the beginning of Section 12.11 Then using the notation from 
the beginning of Section [2Tj a solution to the martingale problem corresponding to 
L, starting at (a;o,2/o)j is a probability measure P( Xo , yo ) on (C[0, oo),S) such that 



p (x ,yo) M°) = Oo,?/o)] = i 



and 
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for any smooth, compactly supported function h on M x N. Further, we note that 
the martingale problem is compatible with stopping times. For example, if we let £ 
be the first hitting time of {r = 0}, then we can consider solutions to the martingale 
problem up until £ by stopping the process at £. In this case, the operator L need 
not be defined at {r = 0}, and the class of test functions h can be restricted to 
those having compact support on (M x N) \ {r = 0}. 

Next, we review some basic facts about martingale problems that we will use 
throughout what follows. There are two basic situations for which a solution to the 
martingale problem is known to exist. If L is C 2 , then there is a unique solution to 
the corresponding martingale problem, at least until possible explosion (see Chapter 
5 of [H]). If L is locally bounded and uniformly elliptic, then there is a strong 
Markov family of solutions to the corresponding martingale problem, at least until 
possible explosion (see Exercises 7.3.2 and 12.4.3 of [15]). Further, both of these 
results can be localized (see Chapter 10 of [E]). Indeed, we have already taken 
advantage of this fact in transferring the above results to the manifold setting. 
Localization also means that a global solution can be constructed by patching 
together local solutions. 

As indicated, specifying a martingale problem corresponds to specifying a second- 
order operator, the diffusion operator. At any point (a;, y) G (M x N) \{r = 0}, we 
have product normal coordinates (a, (3, a, b) as described in the previous section. 
The operator at (x, y) in the (a, /?, a, b) coordinates is given by a symmetric, non- 
negative definite matrix such that the operator applied to a function / 
is given by \ a hj didjf. Let 



O 



A cos a A sin a 
— sin a cos a 



let O t be its transpose, and let I be the 2x2 identity matrix. Then we have 
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is symmetric and non-negative definite, and also that 



It's easy to see that [a^j]!^^) 
it has rank two. 

Carrying out this procedure at every point shows that any pointwise choice of 
A and a does in fact give rise to a well-defined operator. Let L + be the operator 
determined by the optimal choice of coupling from the orientation-preserving com- 
ponent of 0(2), that is, the operator given by (A, a) = (l,cr+), and let L_ be the 
analogous operator arising form the orientation-reversing component of 0(2), given 
by (A, a) = (— 1, <r_). Then define L to be equal to L + on the interior S + and L_ 
on the interior of £_. This determines L uniquely except on Soj where there are 
two possibilities. We will have more to say about S later, but for now we will 
leave open whether we choose the orientation-preserving or orientation-reversing 
possibility. 

Now suppose we have a solution to martingale problem corresponding to L. We 
want to compute the semi-martingale decomposition of any function h on M x N\ 
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{r = 0} composed with the solution process. The bounded variation process is just 
the integral of Lh along paths, while the quadratic variation of the martingale part 
is given by the integral of Vh ■ [aij] ■ V/i along paths. Alternatively, the growth 
of quadratic variation of the martingale part at each point (or more generally, the 
joint variation coming from two different functions) is determined by a bilinear 
form •). In the same normal coordinates around (x,y), we can write a vector 
v e T (x , y) {M x N) as 

v = v a d a + vpdp + v a d a + v b d b . 

Then T(v, v) is determined by its expression in these coordinates. This is most easily 
done by using the inner product structure on the tangent space. Also, we divide 
the formula for T into two cases, the case where (A, a) = (1, a + ) corresponding to 
the diffusion operator at our point being L+ and the case where (A, a) = (— 1, <r_) 
corresponding to L_. We denote the corresponding bilinear forms by T + and T_. 
Then we have 

r + (t> , v) = (v, d a + cos u+d a — sin <T + dt) 2 + (v, dp + sin a + d a + cos a+db) 
r_(u, v) = (v, d a — coscr_<9 a — sincr_9h) 2 + (v, dp — sin cr_9 a + cos(7_<9f,) 2 

Note that T at a point depends only on first-order information at that point. 

We now verify that a solution to the martingale problem corresponding to L gives 
the coupling that it's supposed to. First of all, any solution has marginals which are 
Brownian motions. This follows from projecting [ajj] \( x ,y) down to M (which means 
taking the upper-right 2x2 sub-matrix), which shows that the marginal diffusion 
operator is expressed, in normal coordinates, as the 2x2 identity matrix. Thus, 
the marginal diffusion operator is one-half the Laplacian and the marginal process 
is Brownian motion. The same holds for the marginal process on N. Second, we 
can check that any solution induces a process r t — r(x t ,yt) with the desired semi- 
martingale decomposition. Again, choose (a, (3, a, b) normal coordinates around a 
point (x, y). Since r doesn't depend on a particular choice of Euclidean coordinates, 
we can choose z\, z 2 and z 3 as at the beginning of the previous section. The 
coordinate process will be martingales, and we see that [ctij^^j,) is chosen so that 
their covariance structure is as desired. Once we know that the coordinate processes 
have the desired semi-martingale decompositions, the decomposition for r t follows. 

Ideally, we would produce a solution to the martingale problem for L, and use 
that in our study of minimal surfaces. Unfortunately, L is everywhere degenerate 
and discontinuous at So- Thus, the existence of a solution is not guaranteed by 
standard theorems. More to the point, we don't have sufficient control of the 
configuration angles near So to prove existence for arbitrary minimal surfaces. To 
get around this difficulty, we will modify our operator. The idea is that anywhere 
the coupling is strictly dominated by a time-changed two-dimensional Bessel process 
we have room to adjust it while remaining dominated by such a Bessel process. In 
particular, this will allow us to produce an operator that is easier to deal with 
but such that any solution to the corresponding martingale problem will still be 
adequate (although no longer optimal, in the sense that a solution corresponding 
to L would be). 

To make this precise, let S e C So be the set of points where tp — (recall our 
convention that anywhere we can choose ip to be zero, we do) and 9 + tp = ir/2. 
Thus S e is precisely the set of points in So where the coupling looks instantaneously 
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like a time- changed two-dimensional Bessel process, and where we have no room 
to perturb the operator while keeping the coupling adequate. It follows that, for 
any (x, y) G So \ S e , the value of / — g (recall that / is the time-derivative of the 
quadratic variation and g/2r is the time-derivative of the bounded variation part of 
Tt under any solution to the martingale problem) achieved by the optimal coupling 
is positive. Further, this value depends continuously on the cross-variations (which 
determine how the Brownian motions are coupled at a point), so a sufficiently small 
change in the operator will decrease f — g (which we now view as depending on the 
operator as well) but maintain its positivity. We can't make the operator continuous 
with an arbitrarily small perturbation, but we can make it elliptic. In particular, 
we can replace the cross-variations prescribed by Equation Q (where we choose a 
and A to correspond to the optimal coupling) with the cross-variations prescribed 

by 

d (at, at) = (1 — e)A cos a dt, d (Pt, o.t) — (1 ~ i)A sin a dt, 

d (at, bt) = (1 — e) (— sine) dt, and d({3t, bt) = (1 — e) cosadt. 

In terms of the corresponding operator characterizing the martingale problem, in 
(a, (3, a, b) normal coordinates at a point, it is given by the following matrix, 

" i VT~?o T ' 
VT~^o i 

where / and O are as in Equation ([7]). Heuristically, from the stochastic differential 
equation point of view, this means that Brownian motion on N is only partially 
being driven by Brownian motion on M, and "the rest of" the Brownian motion 
on N is being driven by an independent source, where the ratio between these two 
driving sources is governed by e. In particular, e = gives us the same coupling as 
before, while e = 1 means that the Brownian motions on M and N are independent. 
For any e € [0, 1], the resulting marginals are still Brownian motions on M and N. 
Also, the value of / — g achieved by the coupling is smooth in e. Finally, note that 
if e < 1, then the corresponding operator will be elliptic. 

It follows that we can choose i G [0, 1/2) to depend smoothly on (x, y) £ M x 
N \ {r = 0} in such away that e = e(x, y) is zero outside of some neighborhood 
of So, zero on S e , positive on So \ S e , and so that the corresponding coupling is 
adequate (which also requires e to be zero on the set of points where "0 — and 
= <p < 7r/4). Call the resulting operator L. Of course, L depends on our precise 
choice of a function e. We will simply assume that we've chosen some e satisfying 
the criteria just mentioned, and any will do. It follows from our construction that, 
for any point (x,y) £ So \ S e , L is uniformly elliptic in some neighborhood of 
(x,y), and also that L has the same smoothness properties as L everywhere. The 
advantage of using L instead of L is that now we really only need to worry about 
the behavior of the process near the smaller set of discontinuities S e , which we will 
find much more manageable. 

It is the martingale problem for L for which we will show there exists a solution. 
As mentioned, this will be sufficient for our purposes, since the resulting coupling 
will be adequate. Finally, we comment that our study of L remains relevant, since 
we will primarily approach L as a perturbed version of L. This is especially true 
at S e , since the two operators are equal there and thus we will continue to use our 
expressions for T. 
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4. Existence of an adequate coupling 

4.1. Preliminary results. The fact that we have defined L, and hence L, in a 
different system of coordinates at each point obscures the questions of how smooth 
L is. This isn't too hard to get around near points where the configuration (6, tp, -0) 
is in the interior of its range, since one can see that all of the coordinates (both on 
R 3 and on M x N) introduced in the previous section will be smooth near such a 
point. However, these canonical coordinate systems can be discontinuous when the 
configuration is at the boundary if its range. This means, for example, that the 
identification of 0(2) with isometries between the tangent spaces can also change 
discontinuously. 

One could try to introduce other, better behaved coordinates near such points. 
However, this turns out not to be necessary. The following lemma clarifies the 
behavior of L. 

Lemma 3. Using the above notation, Eo is locally the zero level-set of a smooth 
function, and L is smooth at any point not in Eo . 

Proof: As already mentioned, L is smooth at a point if and only if L is. Thus 
we will prove the lemma for L, since we have more explicit formulas for L. 

We begin by considering the expression of an operator in local, smooth coordi- 
nates. In particular, choose smooth product coordinates (x, y) on some product 
neighborhood S = Sm x Sjy of a point (xo,yo) S (M x N) \ {r — 0}, and smooth 
orthonormal frames for both M and N on Sm and Sn. As above, this choice of 
orthonormal frames on M and N determines an identification of the isometries 
between tangent planes and 0(2). Suppose the operator under consideration is 
determined by a choice of element of 0(2) at each point (x,y) 6 S, in the fashion 
described above. Then if the the map from S to 0(2) is smooth, so is the corre- 
sponding diffusion operator. This is just a consequence of the change of variables 
formula and the fact that all of the functions involved are smooth. 

Thus, in order to show that L is smooth at such a point (xo,yo), it is enough 
to show that, with respect to coordinates and frames as above, the map from S 
into 0(2) which determines L is smooth. Suppose that, with respect to the induced 
identification of the isometries between tangent planes and 0(2), L is determined by 
elements of the orientation-preserving component of 0(2) on S. Let s S [0, 2ir) be 
the natural coordinate on the orientation-preserving component. Then Equation (J5]) 
implies that 

(/ - 9)(x,y,s) = a(x,y) ■ coss + (3(x,y) ■ sins + j(x,y) 

for some functions a, /3, and 7. Here / and g are the functions determining the semi- 
martingale decomposition of r t , under the coupling corresponding to any choice of 
s (assuming such a coupling exists). In particular, this equation gives f — g for any 
choice of s; the value of / — g for L is obtained by letting s be the "optimal choice" , 
as determined above, at each (x,y). 

Note that the images of x and y in R 3 and the corresponding tangent planes vary 
smoothly, as do the projections of the tangent planes onto the direction of x — y 
(since r ^ 0) and its orthogonal complement. Since / and g depend smoothly on 
these projections and on the coupling, it follows that / — g is a smooth function of 
(x, y, s). Further, Equation ([5]) makes it clear that j(x,y) is always non-positive, 
and zero only when (x, y) is such that 9 = if = in the standard configuration. 
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Since we showed in the previous section that the maximum of / — g is always 
non-negative, and since this maximum is equal to i/a 2 + f3 2 + 7 on S (recall that 
we're assuming that the maximum is achieved by a coupling in the orientation- 
preserving component on S), it follows that \J a 2 + (3 2 is positive except possibly 
where 6 = ip = 0. Where 9 = ip = 0, this can be checked by hand, and thus 
\J o? + (3 2 is positive everywhere in S. 

Next, observe that because / — g is smooth in x and y for every fixed s £ [0, 2ir), 
we can see that a, /?, and 7 are smooth functions of x and y. This implies that 
\J a 2 + (i 2 is positive and smooth on S, and thus, possibly by shrinking S, we can 
assume that \J a 2 + j3 2 is smooth and uniformly positive on S. We know that, for 
given x and y, the choice of s that maximizes / — g, which we will denote s + (x, y), 
is the angular component of the polar representation of (a(x, y), /3(x, y)) £ M 2 . 
Since the components of this vector are smooth and and the length of the vector is 
uniformly positive on S, it follows that s + is smooth on S. Since s+ determines L 
on S, it follows that L is smooth on S. Obviously, an analogous argument applies if 
we assume that L is realized by an element of the orientation reversing component 
of 0(2) on S. 

Now suppose that we choose a point (#0,2/0) £ So, and we again choose smooth 
coordinates and smooth orthonormal frames on a neighborhood S as above. At 
{%o,yo), there are two possible choices of optimal couplings, one from each con- 
nected component of 0(2). Let L + be the operator on S obtained from the optimal 
choice of coupling in the orientation-preserving component, and let L- be the anal- 
ogous operator for the orientation-reversing component (these need not the be the 
same as L + and L_, as these were defined for a particular choice of orthonormal 
frames) . Since the value of / — g achieved by each of these two couplings is positive 
at (x , yo), the above argument implies that, after possibly shrinking S, we can as- 
sume that both L + and L_ are smooth on S. Further, the values of / — g realized 
by L + and L_ are also smooth on S. Since So is the set where these two values 
are equal, it follows that So is the zero-level set of a smooth function on S. 

Because So is locally the zero level-set of a smooth function, it is a closed set. 
Thus, if we choose a point (xo,yo) m the complement of So, we can take our 
neighborhood S to be disjoint from S . Then the optimal coupling belongs to 
the same connected component of 0(2) throughout S (that the coupling can only 
"switch" components of 0(2), with respect to smooth frames, at S follows from 
the smoothness of / — g and the fact that S is the only set where there are two 
possible choices for the optimal coupling), and the above argument shows that L is 
smooth on S. □ 

This proof also gives a description of the discontinuities of L and L. We know 
that L can only be discontinuous at So- In a neighborhood of a point in So, both 
L+ and L- (using the notation from the proof, and assuming that we have chosen 
some smooth orthonormal frames on M and N) are smooth. The discontinuity in 
L arises from the fact that we "switch" from having L given by L + to having L 
given by L_. In particular, if we let ft- be a smooth function such that S is locally 
the zero level-set of h, and if the gradient of h is non-zero at (x ,y ) £ S , then S 
is locally a hypersurface, and L is given by L + or _L_ depending on which side of 
the hypersurface we're on, at least locally. An analogous description applies to L, 
with L + and L- replaced by their perturbed versions. 
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One consequence of this lemma is that if we start our coupled Brownian mo- 
tion, corresponding to L, in the complement of So smoothness implies that we 
have a unique solution at least until the first time the process hits So- Further, 
the ellipticity of L gives us what we need for existence on So \ S e , which is the 
reason for introducing L. Near S e , we need the following lemma. Recall that the 
signed distance to a smooth hypersurface is the smooth function, defined in some 
neighborhood of the hypersurface, the absolute value of which is the distance to 
the hypersurface. 

Lemma 4. Let (x,y) be a point of S e . Then there is a neighborhood S of (x,y) 
such that one of the following holds: 

(1) So H S is a smooth hypersurface, and if vq is the gradient of the signed 
distance to So, then either T + (vq,vq) or T-(vq,vo) is positive at (x,y). 

(2) EenS* is contained in some smooth hypersurface H , and if vh is the gradient 
of the signed distance to H, then both T + (vh ,vh) and T—(vh,Vh) are 
positive at (x, y). 

Proof: We begin by introducing some notation. Wc let 

d a = d a + A cos crda — sin adb and dp = dp + A sin ad a + cos adb, 

so that T±(v, v) = (v, d a ) 2 + (v, dp) 2 for the appropriate choice of A and a. 

We need to to look at the first order derivatives of 9, tp, and ip with respect to d a 
and dp when ip = and 9 + tp = tt/2. Fortunately, the derivatives of (x — y)/\x — y\ 
are particularly simple in this case, as we see from Equation ([3]) and the fact that 
both <7_|_ and cr_ are zero when tp = 0. As for the normal vectors m and n, their 
derivatives are constrained only by the fact that the Gauss map is anti-conformal. 
Thus, we can describe the derivatives of m by k\ > and s\ G [0, 2ir) in the 
following way. We know that d a and dp form an orthonormal basis for T X M and 
thus also for T m E> 2 . Then the general anti-conformal map between the two can be 
written as 

d a {m) = ki (cos s\d a + sin s\dp) and dp{m) = k\ (sin s\d a — cos s\dp) . 

An analogous description of the derivatives of n can be given in terms of k% > 
and S2 € [0, 2n) with respect to d a and <9&. 

Because the roles of M and N are symmetric, we can assume that p > 9. 
Recall that So is the zero level-set of cos 9 cos p — cos i\) sin 9 sin tp, at least when 
the configuration is in the interior of its range. Next, note that this continues to 
hold if we allow ip to take negative values. Thus, if we don't insist on using only 
configurations in the canonical range mentioned above, all three angles are smooth 
functions near (x, y), if we assume that 9 > 0, and So is the zero level-set of a 
smooth function. So we first consider the case where 6 > 0. In light of the above 
discussion, a few simple computations show that under these conditions, that is 
whenever ip — 0, < 9 < 7r/4, and tp = ir/2 — 9, we have 

2 

d a (9 + tp) = — (cos 9 — A sin 9) — k\ cos s\ — Ak2 cos S2 ; 
r 

dp (9 + tp) = —k\ sin s\ — &2 sin S2; 

I A 1 1 

d a ip = --kisinsi + -k 2 sins2; and dpip — -fci cos si - — -k 2 coss 2 - 
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We begin by determining when the first possibility in the lemma holds. It's easy 
to see that vo at (x, y) will be a non-zero multiple of the gradient of 8 + tp. As long 
as d a (8 + tp) for at least one choice of A or dg (8 + tp) is not zero, either r + (^o, v ) 
or r_(t>o,Vo) will be positive at (x,y). We conclude that the first condition holds 
unless all three of the following equations are satisfied (at (x,y)): 

2 2 
— ki sinsi = &2 sins2, — cos# = k\ cossi, and — sva.6 = — cos S2- 
r r 

To complete the proof, for the 8 > case, we need to show that the second pos- 
sibility holds whenever all three of the equations are satisfied. We know that S e 
is contained in the zero level-set of tp, at least near (x,y). Note that, if the above 
three equations are satisfied, then ki cos si > and k 2 cos s 2 < 0. This implies that 
<9/3-0 > 0, for both choices of A, since it doesn't depend on A. Thus we can let H 
be the zero level-set of tp, and the second possibility holds. 

We now consider a point (x, y) € S e where 8 = 0. Here, tp is not continuous, and 
8 (which, we recall, is the distance in S 2 between m and (x — y)/\x — y\) has the 
usual non-differentiability of a distance function at its zero level-set. First, observe 
that "0 = requires that to, n, and (x — y)/\x — y\ all lie on the same geodesic in S 2 . 
If we let 7 be the great circle through (x — y)/\x — y\ and n, then 7 varies smoothly 
near (x,y). Further, if we let h be the signed distance between to and 7, then h is 
smooth near (x, y) as well. It follows that S e is contained in the zero level-set of h, 
near (x, y). Because of this, it is natural to ask what conditions must be satisfied in 
order for the second possibility in the lemma to hold. To first order, [x — y)/\x — y\ 
will only move along 7. It follows that if ki 7^ 0, either d a h, for both values of A, 
or dfih will be non-zero. In this case, possibility two of the lemma holds, with H 
the zero level-set of h. 

If fci = 0, then h is zero to first order. In other words, to, n, and (x — y)/\x — y\ 
remain colinear (in the sense of lying on the same great circle) to first order. This 
suggests the following approach. If we allow ip to take values greater than ir/2 and 
8 to take negative values (this is well-defined if we restrict 8 to the great circle), 
then these two angles determine the configuration, up to first order. It's easy to 
check that the characterization of So as points with 8 + tp = it/2 (when h = 0) 
extends to these values of tp and 8. Further, both angles are differentiable and our 
earlier formulas specialize to 

2 

d a (8 + tp) = Ak 2 cos s 2 and da (8 + tp) = —k 2 sin s 2 . 

r 

It is immediate that these cannot both be zero for both values of A. Since we know 
a priori, by Lemma [3l that So is the zero level-set of a smooth function, it follows 
that the first possibility in the lemma holds. □ 

The next two lemmas concern the case of an operator on M. d which, in standard 
coordinates (zi, . . . , z£), can be written in terms of a (measurable) locally bounded 
function a taking values in the set of symmetric, non-negative definite matrices and 
a (measurable) locally bounded function b taking values in M. d as 

d d 

(8) L = - + E 

where the a^j and the bi are the obvious components of a and b. Further, we choose 
C > so that I and \bi \ are all less than C on Br = {z : z\ + ■ ■ ■ + z n < R}. Let 
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P be a measurable, strong Markov family of solutions to the martingale problem 
associated to L, with P x denoting the solution started at x (we assume that such 
a family of solutions exists). 

Lemma 5. Let L and P x be as above, and suppose further that a is uniformly 
elliptic on Br (so that all of its eigenvalues are bounded below by some c > 0). Then 
the expected occupation time under P x from time to T has a density Gx{x,y) on 
Br/ 2 , an d this density obeys the estimate 

sup \\Gt{x,-)\\l*(b r/2 ) < A 
xeR d 

where A and q are positive numbers, with q> d/(d— 1), that depend only on d, R, 
T , c, and C . 

Proof: Because a and b are locally bounded, a standard localization argument 
implies that it is enough to prove the lemma when the process is stopped at the 
first exit time of Br. 

Let o~ i be the first exit time of Br (which is zero if x ^ Br); we denote the 
corresponding density by Gtaui (^il/)- An application of Girsanov's theorem (see 
section 6.4 of [TS]) shows that it's enough to consider the case when 6 = 0. Then 
the estimate for GxAaxix^), as well as the fact that such a density exists, follows 
directly from Corollary 2.4 of [3]. □ 

For the next lemma, we assume that the rectangle (0, 1) x (—1, is contained 

in Br. Because we can rescale the coordinates, this is no loss of generality. 

Lemma 6. Let L and P x be as above. Suppose that a± t i is bounded from below by a 
positive constant c on the rectangle (0, 1) x (—1, 1) . Then the expected occupation 
time of (0, 6) x (—1/2, l/2) d_1 from time to T under P. x goes to zero with 6, at a 
rate which depends only on d, R, T , c, and C . 

Proof: Again, the local boundedness of a and b means that it is sufficient to 
prove the lemma when the process is stopped at a±, the first exit from Br. 

Let £(ir) be a smooth, non-negative, even function such that £(x) < \x\, \£,'( x )\ < 
1, > 0, and £"(x) = 1 on (-1/2,1/2). Then let £ s (x) = 6 2 \{x/6). Because 

£,&{z\ — 6/2) is bounded on Br, its expectation at time T A o\ is bounded from 
above by kb~ for some constant k depending only on d and R (assuming 5 < 1). On 
the other hand, Ito's rule implies that the expectation of £a( z i —6/2) at time TAeri 
is at least c/2 times the expected occupation time of (0, 6) x (—1/2, l/2) d ~ 1 , minus 
(T A a{)C6. It follows that the expected occupation time of (0, 5) x (-1/2, l/2) d_1 
is less than 

HCfTAdi) . 
~2 ^ 

Thus we've proved that the lemma holds for the process stopped at o~\. □ 

4.2. Proof of existence. We are now in a position to prove the existence of an 
adequate coupling, namely, the existence of a solution to the martingale problem 
corresponding to any choice of operator L as described above. 

Theorem 7. Let M and N be any stochastically complete minimal surfaces. For 
any points xq G M and t/o£JV with r(xo, yo) ^ 0, there exists an adequate coupling 
of Brownian motions started at xq and yo defined until the first time r t = r(xt,yt) 
hits zero, in particular, the coupling corresponding to any choice of operator L as 
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described above. Further, given any such L, the corresponding coupling is unigue 
until the first time it hits So . 

Proof: Existence of a solution to the martingale problem for L starting from 
(xo,j/o) is not guaranteed by standard results, so we proceed by an approximation 
argument. Consider the family of operators L^' — L+A/j where A is the Laplacian 
on the product manifold M x N. In order to deal with stopping the process at 
{r = 0}, we will need a second level of approximation. For all small, positive e, let 
7y £ be a smooth function from M x N into [0, 1] that is 1 on {r < e/2} and on 
{r > e}. Then we let 



We note that L^' fc ) is defined on all of M x N, unlike L which is not defined on 



In order to apply standard theorems in martingale theory, it will be helpful to 
introduce global coordinates and thus work on R 4 . As usual, we can pass to the 
universal covers of M and N, and so we assume that they are simply connected. Be- 
cause they have non-positive curvature, normal coordinates around any point give 
global coordinates, and map each surface diffeomorphically to R 2 . This extends to 
the product manifold in the obvious way. In these global coordinates, each L^' k ^ 
can be written in the form shown in Equation 1(5)) with locally bounded, locally uni- 
formly elliptic coefficients. It follows that for each L^ ,k ' we can find a measurable, 
strong Markov family of solutions to the corresponding martingale problem (see 
the beginning of Section [3. 21 for the definition of the martingale problem) which we 
denote P,^,. (A priori, these solutions are defined only up to explosion, but we 
will see in a moment that they never explode). In order to simplify the notation, 
we will assume some starting point (a5o,j/o) nas been chosen and simply denote 
the corresponding measures by pU> k > whenever there is no possibility of confusion. 
Consider the marginal processes on M and N. In each case, the marginal distri- 
bution of ptw solves the martingale problem for (j + 2)/2j times the Laplacian. 
This is just time-changed (by a constant factor) Brownian motion on a stochasti- 
cally complete manifold. It follows that the PU>*0 processes never explode, since 
M and N are both stochastically complete and a process on a product manifold 
blows up if and only if one of the marginals blows up. 

Next, we need to show that any sequence of {PU> fe )} has a weakly convergent 
subsequence, that is, that this family of measures is pre-compact. All of our process 
start from the same point, so Theorem 1.3.1 of [T5] asserts that {p(-?> fe )} is pre- 
compact if and only if, for every p > and T < oo, we have 



Recall that the marginals on M and N are time-changed (by a constant factor) 
Brownian motions, and thus we see that the family of marginals on M and the 
family of marginals on N both possess the property described by Equation ([9]) . For 
any path on the product space, the increment \u>{t) — w(s)| R4 is bounded by the 
sum of the increments of the projections onto M and N, by the triangle inequality. 




{r = 0}. 



(9) 



lim infP (i ' fc) 

6\0 j,k 



sup \to(t) 

0<s<t<T 
t-s<6 



w(s)| R4 < P 



= 1. 
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It follows that the family {p(^ k )} possesses the property described by Equation (|9j). 
and thus any sequence of {p(-?- fc )} has a weakly convergent subsequence. 

Consider any sequence (j(l),k(l)) such that — > oo and k(l) > fco for some 
positive integer kg. The corresponding sequence of measures has a convergent sub- 
sequence, so after possibly re-indexing our sequence, we can assert that pUwMO) 
converges to a limit we call P k ° (obviously, the limit depends in general on the 
sequence k(l)) and not just on ko, but we'll see that this notation will be suf- 
ficient for our purposes). Further, let Ce be the first hitting time of the set {r < e}; 
in particular, Co = C which we have previously defined as the first hitting time of 
{r = 0}. 

We wish to prove that P ka is a solution to the martingale problem corresponding 
to L until Ci/feo- It's easy to see that P k ° [o>(0) = (xo,yo)] = 1 almost surely, and 
so it remains to prove that P k ° has the desired martingale property. For this, it is 
enough to show that 



E 



F h(u(t)) — h(u>(s)) — I Lh(u(u))du 



= 



for < s < t, any bounded, continuous, £? s -measurable function F : C[0, oo) — * M, 
and any smooth h compactly supported on (M x N) \ {r < l/ko}, where the 
expectation is with respect to P k ° . This in turn will follow if we show that 



(10) 



- I) h(Lu{u)) 



du 



F / Lh(uj(u))du 



- E 


H 







Lh(u>(u)) du 



goes to zero as I goes to infinity, where is expectation with respect to pW)-' c ( / )) . 
The first term goes to zero because F is bounded and [L^^ — L) h converges 
to zero uniformly, using the fact that h is smooth and compactly supported on 
(M x N) \ {r < 1/fco} and L {j{l)) converges to L on (M x N) \ {r < l/k }. 

For the second term, we note that, by a partition of unity argument, it is suf- 
ficient to show that for any point (x, y) G (M x N) \ {r < 1/fco} there is an open 
neighborhood S of the point such that the second term goes to zero for h supported 
on S (and thus S should be taken to be disjoint from {r < 1/fco}). For the purposes 
of such an argument, we see that there are three types of points. First, suppose 
that (x,y) is contained in the complement of Eo. Then we can choose S also to 
be contained in this interior. If h is smooth and supported on S, then so is Lh, 
using the fact that L is smooth on the complement of So- Then the second term 
of Equation (flO|) goes to zero by the definition of weak convergence (recall that F 
is continuous). 

Next, suppose that (x,y) £ Eo\E e . Then L is uniformly elliptic and bounded on 
any sufficiently small neighborhood of (x,y). Moreover, we can choose coordinates 
centered at (x,y) so that an d p(o( l )M 1 )) satisfy the assumptions of LemmaE] 

for some choice of constants R, c, and C independent of n, and with d = 4 and 
T > t. Also, we can choose S, our neighborhood of {x,y), to be contained in B R / 2 
(in these coordinates) and disjoint from {r < 1/fco}- Let q > 4/3 be the constant 
from Lemma [5l and let q' be its Holder conjugate. Then we can find a sequence of 
continuous functions £ m supported on B R / 2 which approximate Lh in L q (B R / 2 ). 
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By weak convergence, we know that 



F / € m (u>(u))du 



E 



as I — * oo. Further, Lemma [5] and the Holder inequality imply that 



E (0 



E (0 



F / Lh(oj(u))du 



as to — > oo, uniformly in Z. Because this convergence is uniform in I, combining 
these two equations shows that the second term of Equation (flO|) goes to zero as 
desired. 

The final type of point we need to consider is (x,y) £ S e . This divides further 
into two cases, depending on which of the possibilities in Lemma |4] holds. Suppose 
that the second possibility in Lemma 2] holds. Let T be the bilinear form that gives 
the cross-variation of vector fields under L. We know that the operators corre- 
sponding to the optimal orientation-preserving and reversing couplings are smooth, 
and that L agrees with L at (x,y). It follows that T(vh,vh) is bounded below by 
a positive constant on an open neighborhood S of (x,y); we also assume that S' 
is disjoint from {r < l/ko}. Thus, we can find coordinates (zi, . . . , Z&) centered at 
(x, y) such that Lv'(O) and pu(0> fc (0) satisfy the assumptions of Lemma|6]on both 
sides of H for some choice of constants R, c, and C independent of I, with d = 4 
and T > t; and where these coordinates are such that 

(-1, l) 4 C S' C Br 

and z\ restricted to S' is the signed distance to H. In particular, Lemma [6] implies 
that the occupation time of (-S, 5) x (—1/2, 1/2) goes to zero in 8 at a rate that 
can be taken to be independent of I. 

We can assume that the support of h is contained in S — (— 1/2, 1/2) . For any 
S 6 (0, 1/2), we can find a mollified version of Lh, which we denote £g, with the 
properties that £5 is supported on S, £5 is continuous in a <5/2-neighborhood of S e , 
the £5 are bounded uniformly in 5, Lh and £5 are equal outside of a (5-neighborhood 
of S e , and the discontinuities of £5 are contained in an open set where L is uniformly 
elliptic (here we mean uniformly in space, not in 6). We claim that, for any 6, 



(11) 



&(a;(u)) du 



E 



F / ft(w(u))du 



as Z — * 00. To sec this, note that a bump function argument shows that £5 can 
be written as the sum of a bounded function supported on a subset of S where 
L is uniformly elliptic and a continuous function supported on S. The argument 
given above for Lh supported on a set where L is uniformly elliptic shows that 
we have the desired convergence for the first term in the decomposition of £5, and 
weak convergence applies directly to the second term in the decomposition. The 
claim follows. Next, note that the difference in the expectation of £5 and Lh is 
bounded by a constant that does not depend on I times the occupation time of 
(—6,5) x (—1/2, 1/2), which we've already seen goes to zero with 5, uniformly in /. 
Thus we have that 



E (0 



F / £ s (uj(u))du 



E (0 



F / Lh(cu(u))du 
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as 6 \ 0, uniformly in I. Because this convergence is uniform in I, combining this 
with Equation (fTTj) shows that the second term of Equation (fT0|) goes to zero as 
desired. 

Now suppose the first possibility in Lemma [4] holds. The argument is similar 
to the previous case. Without loss of generality, we can assume that T + (vo,Vq) 
is positive at (x,y), since the the roles of T + and L_ are symmetric. It follows 
that T(vo,vq) is uniformly positive "on one side" of So in 5", where S' is some 
open neighborhood of (x,y), disjoint from {r < l/ko}. Then S' \ So is naturally 
divided into two disjoint, connected open sets, say S[ and S' 2 , and T(vq,vq) is 
bounded below by a positive constant on one of them, which we can assume is S[ . 
Then we can find coordinates (z\,... , Z4) centered at (x,y) such that L^w) and 
ptiwM 1 )) satisfy the assumptions of Lemma |6] for some choice of constants R, c, 
and C independent of I, with d = 4 and T > t; and where these coordinates are 
such that 

(0,l)x(-l,l) 3 cS'c5 fl 

and Z\ restricted to S' is a constant (non-zero) multiple of the signed distance to 
S , with positive z\ corresponding to S[ (this implies that S nS" is {z\ = 0}nS"). 
In particular, we note that, in the notation of Lemma [6l this implies that ai t i is 
bounded below by a positive constant on = {z\ > 0} n S' . 

We can assume that the support of h is contained in S = (—1/2, 1/2) 4 . In 
contrast to the previous cases, here the definition of L on S e matters. In particular, 
we assume that L|s is chosen to be the limit when approached from within S' 2 (this 
is equivalent to having it agree with the optimal orientation-preserving coupling on 
So if L corresponds to the orientation-preserving coupling on S' 2 , and similarly for 
the orientation-reversing case). We will say more about this assumption below. 
This implies that, for each 5 £ (0, 1/2), we can find a mollified version of Lh, which 
we again denote with the properties that £5 is continuous, the £5 are bounded 
uniformly in S, and Lh and £5 are equal outside of (0,6) x (—1/2, 1/2) 3 . Weak 
convergence means that 



&(a;(u)) du 



F / &(Lu(u))du 



as I — > 00. In addition, Lemma [S] and our choice of £5 imply that 



F / £ 5 (u(u))du 



E (0 



F / Lh(u)(u))du 



as 6 \ 0, uniformly in /. Because this convergence is uniform in I, combining these 
two equations shows that the second term of Equation (flQ|) goes to zero as desired. 

The only aspect of the argument in the preceding paragraph that needs comment 
is the possibility of globally defining L on S e . However, comparing the arguments 
in the cases when the first or second possibility of Lemma |4] holds, we see that the 
choice of L on S e only matters near points (x, y) G S e where the second possibility 
does not hold for any H and where either L+^q,^) or T_(vq,v ) is zero at (x,y) 
(which, of course, must be true if the second possibility doesn't hold, since otherwise 
we could just take H to be So). Near any such point (x,y), we know that there 
exists a neighborhood S of (x, y) such that S n S is a smooth hypersurface. If we 
choose smooth orthonormal frames for M and N on S, then the set of points where 
L must correspond to the orientation-preserving coupling and the set of points 
where L must correspond to the orientation-reversing coupling are closed, disjoint 
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sets. It follows that we can make a global choice of L (on (M x N) \{r = 0}) 
such that L|s is what it must be for the above argument to work at all points 
(x, y) € S e where only the first possibility of Lemma [3] holds. From now on, we will 
assume that L has been defined on Y.q in a way that satisfies the above description. 
As this is all that is needed, we've proved that P k ° is a solution to the martingale 
problem corresponding to L starting at (xo,yo) until Ci/fe - 

To continue, note that the limit of a convergent sequence pC?(0»*(0) depends 
only on the tail of the sequence. Thus we've actually shown that if — > oo and 
liminf k(l) > fco> then the limit of a convergent subsequence is a solution to the 
martingale problem corresponding to L until £fc . Choose (j(l),k(l)) so that 
and k(l) both go to infinity with pass to a convergent subsequence, and call the 
limit P. Then because liminf k{l) = oo, P is a solution to the martingale problem 
corresponding to L until d/j. for every positive integer k. Since C,\/k / C as & — * oo, 
it follows that P is a solution to the martingale problem corresponding to L until 
Q. The proof of the existence of a solution to the martingale problem for L starting 
from any point (xo,yo) S (M x N) \ {r = 0} and stopped at the first hitting time 
of {r = 0} is complete. 

To prove the final assertion of the theorem, we recall that L is smooth on the 
complement of So- Thus the martingale problem corresponding to L has a unique 
solution until the first hitting time of So- By uniqueness, the solution just con- 
structed (and, moreover, any solution to the martingale problem for L) must agree 
with this solution until the first hitting time of So- □ 

5. Applications of the coupling 

5.1. Strong halfspace-type theorems. A strong halfspace theorem states that 
two minimal surfaces, satisfying some condition, either intersect or are parallel 
planes. Hoffman and Meeks [I] proved a strong halfspace theorem for (complete) 
properly immersed minimal surfaces. Their proof used geometric measure theory 
to show that two such non-intersecting minimal surfaces are separated by a stable 
minimal surface, which must then be a plane (since planes are the only stable min- 
imal surfaces in K 3 by a result of Schoen [13]). This reduces the problem to the 
corresponding weak halfspace theorem. Rosenberg [12] proved a strong halfspace 
theorem for complete minimal surfaces of bounded curvature, as did Bessa, Jorge, 
and Oliveira-Filho [1] (this later paper also gives a "mixed" strong halfspace theo- 
rem in which one minimal surface is properly immersed and the other is complete 
with bounded curvature). 

The ultimate goal of introducing our coupled Brownian motions is to show that 
the particles couple, either with positive probability or with probability one. If 
the particles couple with positive probability, then the minimal surfaces on which 
they move obviously intersect. This gives a potential method for proving strong 
halfspace theorems or similar results. The issue is proving that the particles couple. 
The distance between the particles, under our coupling, is dominated, after time 
change, by a two-dimensional Bessel process. Recall that a two-dimensional Bessel 
process comes arbitrarily close to zero, while a Bessel process of dimension less than 
two strikes zero in finite time almost surely. Thus, heuristically, we see that the 
only obstacles to our particles coupling is that the distance between them might 
converge, corresponding to the process accumulating only finite quadratic variation, 
or that the the process might look too much like a two-dimensional Bessel process 
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when the particles are close, causing them to come arbitrarily close but never to 
couple. The rate at which the quadratic variation grows and the ratio of the drift 
to the dispersion are both determined by the configuration of the tangent planes, 
that is, by the angles 9, tp, and ip introduced above. 

Unfortunately, it's not clear how to get the necessary control of the evolution of 
these angles to prove the full strong halfspace theorem for either properly immersed 
or bounded curvature minimal surfaces. Instead, we have the following partial 
result. 

Theorem 8. Let M be minimal surface that is either recurrent or stochastically 
complete with bounded curvature, and let N be a stochastically complete minimal 
surface. Then if M is not flat, dist (M,N) = 0. 

Proof: Consider Brownian motion on M and N, coupled as described above 
(for arbitrary starting points), and assume that M is not flat. Then the particles 
will become arbitrarily close (whether or not they meet) as long as the distance 
between them does not converge to some positive limit. To show that this cannot 
happen, we proceed by contradiction. Assume that, with positive probability, Tt 
converges to a positive value (and thus that the process continues for all time). 
The vector x% — yt is an R 3 -valued martingale, and it is easy to see that its length 
can only converge if the vector itself converges. This, in turn, means that the 
direction of the vector in § 2 must converge. However, because M is recurrent 
or stochastically complete with bounded curvature and is not flat, we know from 
Theorem [1] that, up to a set of probability zero, any path that continues for all time 
has a normal vector that spends an infinite amount of time in every open set of S 2 . 
We conclude that, along such a path, the system spends an infinite amount of time 
where 8 £ (37r/4, tt/2). The rate of growth of quadratic variation is bounded from 
below by a positive constant on this set, and thus the r t -process must accumulate 
infinite quadratic variation. This contradicts our assumption that it converges, and 
the proof is complete. □ 

Note that this proof shows that any Brownian motion on M almost surely be- 
comes arbitrarily close to N and vice versa. Further, in contrast to Rosenberg's 
result, only one of the minimal surfaces needs to be stochastically complete and 
have bounded curvature (or be recurrent); the other need only be stochastically 
complete. On the other hand, the weakness of this theorem is obvious. We have 
only shown that the distance between the surfaces is zero, not that they actually 
intersect. As the proof makes clear, the difficulty is controlling the process for 
small r, to rule out both the possibility that r converges to zero and the possibility 
that it becomes arbitrarily close to zero without ever hitting it. One might hope 
that better understanding of the process for small r would allow the theorem to be 
strengthened to conclude that the surfaces intersect. 

In the properly immersed case, we don't even have the analogue of Theorem [S] 
This is a consequence of our inability to control the long-term behavior of the normal 
vector to any extent greater than that implied by the weak halfspace theorem. 

5.2. Maximum principle at infinity. In a more positive vein, one nice feature of 
the use of coupled Brownian motions is that one expects results to extend naturally 
to the case of minimal surfaces with boundary, as mentioned above. By a minimal 
surface with boundary, we mean a surface with boundary together with an immer- 
sion which is minimal on the interior and extends continuously to the boundary. 
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Our approach requires extending Theorem [Jj to the case when one or both of M 
and N are allowed to have boundary. To do this, assume that M has non-empty 
boundary. Then it is fairly straightforward to show that the first hitting time of 
the boundary of M is almost surely continuous, with respect to Brownian motion 
on M, on the set where it is finite. The same is true of N in case it has non-empty 
boundary. The first hitting time of the boundary of M x N, which we denote n, 
is the minimum of the first hitting times of the boundaries of M and N. All of 
the processes P^'' fc ) that we introduced in the proof of Theorem [7] and all of their 
weak limits have marginals that are time-changed (by a constant factor) Brownian 
motion, so s A r\ and t A n (for fixed times < s < t) are bounded and almost surely 
continuous with respect to all of these measures. Because of this boundedness and 
almost sure continuity, the weak convergence argument we used in the proof of 
Theorem [JJ to show that the limit measure is a solution to the martingale problem 
for L is compatible with stopping all of the processes at the boundary. Thus we 
see that that we can solve the martingale problem for L, stopped at the boundary, 
and this gives an adequate coupling, stopped at the boundary, in the case when 
one or both of our minimal surfaces has boundary. (The reason we had to use an 
approximation argument earlier when stopping the process at £ is because it's not 
clear that £ is almost surely continuous.) 

The model theorem from (non-stochastic) geometric analysis is the following 
version of the maximum principle at infinity, recently proved by Meeks and Rosen- 
berg 0. 

Theorem 9. Let M and N be disjoint, complete, properly immersed minimal 
surfaces-with-boundary, at least one of which has non-empty boundary. Then the 
distance between them satisfies 

dist(M, N) = min{dist(M, dN), dist(9M, N)}. 

This is a generalization of the strong halfspace theorem for properly immersed 
minimal surfaces. It is proved by similar methods, although the addition of the 
boundary makes things more difficult. We expect the analogue in the bounded 
curvature case to be true, although to our knowledge no (previous) work has been 
done in that direction. However, we have the following version of the maximum 
principle at infinity for minimal surfaces-with-boundary of bounded curvature. 

Theorem 10. Let M and N be stochastically complete minimal surfaces-with- 
boundary, at least one of which has non-empty boundary, such that dist(Af, N) > 0. 
If M has bounded curvature or is recurrent, and is not flat, then 

dist(M, N) = min{dist(A/, ON), dist(<9M, N)}. 

Proof: Suppose M and N satisfy the hypotheses of the theorem, and that 
dist(Af, N) = a > 0. Consider any point (xo, yo) m the interior of M x N. We run 
a coupled Brownian motion starting at (xq, yo), stopped when it hits the boundary. 
With one caveat, it is clear from our proof of Theorem [8] that r t almost surely hits 
the boundary in finite time, having accumulated only finite quadratic variation, 
since otherwise the process would hit a level below a with positive probability. The 
caveat is that the proof of Theorem [8] uses Theorem Q] which we have not proved 
for surfaces-with-boundary. However, we can prove that any Brownian path on M 
with an infinite lifetime has a normal vector which accumulates infinite occupation 
time in every open set of S 2 , up to a set of probability zero, as follows. If M is 
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recurrent this is clear for the same reasons as before, so assume that M is transient 
(meaning the interior of M is transient). Let M be the universal cover of the in- 
terior of M, so that M is conformally equivalent to the unit disk. Then M can be 
described by Weierstrass data just as before, and M also has bounded curvature. 
Further, we see that the argument in the proof of Theorem [T] can now be applied 
to the set of paths with infinite lifetime, so we conclude that any such path has a 
normal vector which accumulates infinite occupation time in every open set of S 2 , 
up to a set of probability zero. This establishes our claim that r t almost surely hits 
the boundary of M X N in finite time. 

To complete the proof, first suppose that, with positive probability, r t accumu- 
lates no quadratic variation. It follows that this set of paths produces points (x, y) 
on the boundary with dist(x,y) = dist(xo, yo). For the other case, suppose that r t 
almost surely accumulates positive quadratic variation. Then, by comparison with 
a two-dimensional Bessel process, there is some e > such that r t hits the level 
dist(xo, yo) — e with probability at least 1/2. Since the process must stop before rt 
gets below level a, comparison with a two-dimensional Bessel process shows that 
there is a positive probability that the paths along which rt hits dist(xo, yo) — e are 
stopped at the boundary before r t increases to dist(xo, yo) — e/2. This produces 
points (x,y) on the boundary with dist(x, y) < dist(xo, yo)- 

It follows from the above that, for any point (xo, yo) in the interior of M x N, 
there is a point (x, y) on the boundary of M x N such that dist(x, y) < dist(xo, yo). 
We conclude that dist(M, N) = min{dist(M, dN), dist(<9M, N)}, and the theorem 
is proved. □ 

5.3. Liouville theorems. In the previous section, we coupled Brownian motions 
on two different surfaces in an attempt to control the distance between these sur- 
faces. However, one can also consider coupling two Brownian motions started at 
different points on the same surface. This gives an approach to proving that there 
are no non-constant bounded harmonic functions on certain classes of minimal sur- 
faces. In particular, suppose that M is an embedded (by which we mean injectively 
immersed) minimal surface such that Brownian motions started from any two points 
couple almost surely. By embeddedness, the fact that they couple in the extrinsic 
distance also means that they couple in the intrinsic distance. Then the standard 
representation of harmonic functions as integrals of Brownian motion with respect 
to bounded stopping times shows that any bounded harmonic function must be 
constant. 

Our efforts in this direction are guided by the following conjecture, which appears 
to go back to Sullivan (see Conjecture 1.6 of [7] and the surrounding discussion). 

Conjecture 11. A complete, properly embedded minimal surface admits no non- 
constant, positive harmonic functions. 

Though the full conjecture remains open, various special cases are known. Of 
course, any class of surfaces which are recurrent satisfies the theorem. For example, 
Theorem 3.5 of [2] states that any complete, properly embedded minimal surface 
with two limit ends (see the introduction of the paper just cited for a discussion of 
ends and limit ends) is recurrent. As for results that apply to transient surfaces, 
in [9], Meeks, Perez, and Ros prove the conjecture under the additional assumption 
that the surface possesses one of various symmetries (such as being doubly or triply 
periodic, or having a sufficiently large isometry group). Their method is to look 
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at the universal cover, which must be conformally equivalent to the unit disk, and 
show that the group of Deck transformations must be so large that it can leave only 
constant positive harmonic functions invariant. 

We provide another partial result, under the additional assumption that M has 
bounded curvature. Also note that, as indicated above, our result only prohibits 
non-constant bounded (rather than positive) harmonic functions. 

Theorem 12. Let M be a complete, properly embedded minimal surface of bounded 
curvature. Then M has no non-constant bounded harmonic functions. 

Proof: Choose any two distinct points xo and yo i n M, and run our adequate 
coupling of Brownian motions from these points. Our proof of Theorem [8] shows 
that Tt either hits zero in finite time or else spends an infinite amount of time in 
every neighborhood of zero. To prove the theorem, it is enough to prove that rt 
almost surely hits zero in finite time, as discussed above. 

As a consequence of their maximum principle at infinity (which we stated above), 
Meeks and Rosenberg were also able to prove that any properly embedded minimal 
surface with bounded Gauss curvature has a fixed size tubular neighborhood (see 
the first paragraph of Section 5 of [8] along with Theorem 5.3). With bounded 
curvature, the existence of such a tubular neighborhood implies that there is some 
a > such that, whenever r t = r{x t ,yt) < a, the distance between x t and y t with 
respect to the metric on M, which we denote distM^t, yt), is less than or equal to 
2a. Further, if this property holds for some particular a, then it also holds for any 
smaller a. 

Because the curvature is bounded and the embedding is minimal, the entire 
second fundamental form of the embedding is uniformly bounded. This means 
that any fixed sized (with respect to the metric on M) neighborhood of a point is 
uniformly comparable to the tangent plane at that point. More concretely, for any 
e, we can choose a small enough so that, whenever dist M(x,y) < 2a, the resulting 
configuration satisfies 
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One consequence of this is that, by choosing a small enough, we can guarantee that 
the set of points with r < 2a is disjoint from So- bi fact, because we also know 
that the curvature is bounded, the subset of M X N where r < 2a is a positive 
distance (in the product metric) from So. Thus we can assume that our modified 
coupling, given by L, agrees with the optimal coupling (given by L) on the set 
where r < 2a. Under the optimal coupling, 6 — ip — tt/2, ip = corresponds to rt 
evolving like the standard mirror coupling in the plane, dr t = 2 dB t . So by taking 
small enough e and a, the configuration can be made arbitrarily close to that of 
the standard mirror coupling on the plane, which is just time-changed Brownian 
motion, whenever distjvf(xt, yt) < 2a. The other consequence of the fact that the 
set {r < 2a} is disjoint from S that we need is that the (x t , yt) -process, started at 
any point in {(x,y) : r(x,y) < 2a}, is unique at least until its first exit time from 
this set. In particular, the process has the strong Markov property on any time 
interval during which it is contained in this set. 

The above shows that the rate of growth of quadratic variation of r t is bounded 
from below on {(x, y) : r(x, y) < 2a}, so if r t doesn't strike zero in finite time, it hits 
the level a and then leaves the set {(x, y) : r(x, y) < 2a} infinitely many times. The 
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point now is to argue that each time rt hits a it has some probability, bounded from 
below, of hitting before it hits 2a. Then we wish to use the (almost) independence 
of these events to conclude that, almost surely, on one of these occasions r t must 
hit 0. As indicated above, this proves the theorem. The rest of the proof is devoted 
to making this argument precise. 

Note that a one-dimensional Bessel process (which corresponds to 6 = ip = n/2, 
ip = 0) has the property that, if started at some I, it hits zero before it hits 21 with 
probability 1/2. In light of the above, we can choose a so that whenever r t < 2a, 
r t is dominated by a time-changed Bessel process of dimension d, where d is such 
that a Bessel process of dimension d started at a strikes zero before 2a with some 
probability p > 0, and so that the time-change satisfies the estimate dr/dt > 3/4. 
Let t\ be the first time that r t hits the level a, and let be the comparison 
Bessel process started from level a at time t\; 

p% = a + W T{t) - W T(tl) + f ^ ^1 ds, r(t) - f fds, 

Mti) 2p\( s) Jo 

where W T is a Brownian motion. 

As usual, we stop (x y ,y t ) and if r t hits zero. Let <j\ be the first time after 

ti that p^h. hits 2a; by convention, a\ = oo if r t strikes zero before p^l. hits 2a. 
We now iterate this procedure. For n > 2, let t n be the first time after a n -i that 

(n) 

r t hits a, and let a n be the first time after t n that either r t hits or p^,L hits 2a. 

Here p^u-i is defined by the same equation as p^L , except that it is begun at from 

level a at time t n . Now consider the processes {pjuyt n <t< t n+ {\. 

Recall that, starting from any point with r — a, the (x t , yt) -process is unique 
at least until r* hits 2a. Using this and the fact that the [t n ,t n +i] are disjoint, we 
see that this collection of processes is independent, and that each one enjoys the 
strong Markov property during its lifetime. Thus, the probability that a n is finite 
(that is, the probability that r t drops to level a and escapes back up to 2a without 
striking zero n times) is less that (1 — p) n . Since this goes to zero as n — > oo, we 
see that, almost surely, r t hits zero in finite time, completing the proof. □ 
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